The distance correlation t-test of independence in high dimension
نویسندگان
چکیده
AMS subject classifications: primary 62G10 secondary 62H20 Keywords: dCor dCov Multivariate independence Distance covariance Distance correlation High dimension a b s t r a c t Distance correlation is extended to the problem of testing the independence of random vectors in high dimension. Distance correlation characterizes independence and determines a test of multivariate independence for random vectors in arbitrary dimension. In this work, a modified distance correlation statistic is proposed, such that under independence the distribution of a transformation of the statistic converges to Student t, as dimension tends to infinity. Thus we obtain a distance correlation t-test for independence of random vectors in arbitrarily high dimension, applicable under standard conditions on the coordinates that ensure the validity of certain limit theorems. This new test is based on an unbiased es-timator of distance covariance, and the resulting t-test is unbiased for every sample size greater than three and all significance levels. The transformed statistic is approximately normal under independence for sample size greater than nine, providing an informative sample coefficient that is easily interpretable for high dimensional data. 1. Introduction Many applications in genomics, medicine, engineering, etc. require analysis of high dimensional data. Time series data can also be viewed as high dimensional data. Objects can be represented by their characteristics or features as vectors p. In this work, we consider the extension of distance correlation to the problem of testing independence of random vectors in arbitrarily high, not necessarily equal dimensions, so the dimension p of the feature space of a random vector is typically large. measure all types of dependence between random vectors in arbitrary, not necessarily equal dimensions. (See Section 2 for definitions.) Distance correlation takes values in [0, 1] and is equal to zero if and only if independence holds. It is more general than the classical Pearson product moment correlation, providing a scalar measure of multivariate independence that characterizes independence of random vectors. The distance covariance test of independence is consistent against all dependent alternatives with finite second moments. In practice, however, researchers are often interested in interpreting the numerical value of distance correlation, without a formal test. For example, given an array of distance correlation statistics, what can one learn about the strength of dependence relations from the dCor statistics without a formal test? This is in fact, a difficult question, but a solution is finally available for a large class of problems. The …
منابع مشابه
The Relationship between intercanthal dimension and width of maxillary anterior teeth
The Relationship between intercanthal dimension and width of maxillary anterior teeth Dr. MH. Lotfi Kamran *- Dr. A. Haghparast - Ghomsheh ** *Assistant Professor of Prosthodontics Dept., Faculty of Dentistry, Shaheed Sadoughi University of Medical Sciences. ** Dentist. Abstract Background and Aim: One of the difficult aspects of designing complete denture prosthesis is the selection of appropr...
متن کاملThe Recognition and Ranking of the Dimensions of University Autonomy with Reason and Cause Approaches for the Efficiency of Islamic Azad Universities
The general purpose of this study was to identify and rank the dimensions of academic independence with causal and causal approaches in Islamic Azad universities of Mazandaran province. This research was applied in terms of purpose and descriptive in terms of method. The statistical population consisted of all faculty members of Islamic Azad universities of Mazandaran province in the number of ...
متن کاملHigh Dimensional Correlation Matrices: CLT and Its Applications
Statistical inferences for sample correlation matrices are important in high dimensional data analysis. Motivated by this, this paper establishes a new central limit theorem (CLT) for a linear spectral statistic (LSS) of high dimensional sample correlation matrices for the case where the dimension p and the sample size n are comparable. This result is of independent interest in large dimensiona...
متن کاملRelationship between Nurses\' Work Effectiveness and Their Work Engagement
Background: Nurses play an important role in the healthcare system and constitute the highest number of healthcare workers. A positive attitude or a positive state of mind at work may increase the ability to carry out the organization's duties, tasks and plans. The aim of this study was to investigate the relationship between work effectiveness and work engagement of nurses working in the selec...
متن کاملCardiac Troponin T Response during High-Intensity Competition and its Correlation with the Corrected QT Interval among the Trained Athletes
Introduction: It is highly crucial to develop novel strategies in order to further enhance the monitoring of sudden cardiac arrest in athletes and establish standard medical care procedures to control monitoring and treatment costs. Therefore, this study aimed to investigate the effects of a high-intensity competition on cardiac troponin T (cTnT) and its correlation with the corrected QT interv...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Multivariate Analysis
دوره 117 شماره
صفحات -
تاریخ انتشار 2013